- 01. Introduction
- 02. Lesson Overview
- 03. Why Data Lakes: Evolution of the Data Warehouse
- 04. Why Data Lakes: Unstructured & Big Data
- 05. Why Data Lakes: New Roles & Advanced Analytics
- 06. Big Data Effects: Low Costs, ETL Offloading
- 07. Big Data Effects: Schema-on-Read
- 08. Big Data Effects: (Un-/Semi-)Structured support
- 09. Demo: Schema On Read Pt 1
- 10. Demo: Schema On Read Pt 2
- 11. Demo: Schema On Read Pt 3
- 12. Demo: Schema On Read Pt 4
- 13. Exercise 1: Schema On Read
- 14. Demo: Advanced Analytics NLP Pt 1
- 15. Demo: Advanced Analytics NLP Pt 2
- 16. Demo: Advanced Analytics NLP Pt 3
- 17. Exercise 2: Advanced Analytics NLP
- 18. Data Lake Implementation Introduction
- 19. Data Lake Concepts
- 20. Data Lake vs Data Warehouse
- 21. Data Lake Options on AWS
- 22. AWS Options: EMR (HDFS + Spark)
- 23. AWS Options: EMR: S3 + Spark
- 24. AWS Options: Athena
- 25. Demo: Data Lake on S3 Pt 1
- 26. Demo: Data Lake on S3 Pt 2
- 27. Exercise 3: Data Lake on S3
- 28. Demo: Data Lake on EMR Pt 1
- 29. Demo: Data Lake on EMR Pt 2
- 30. Demo: Data Lake on Athena Pt 1
- 31. Demo: Data Lake on Athena Pt 2
- 32. Data Lake Issues
- 33. [AWS] Launch EMR Cluster and Notebook
- 34. [AWS] Avoid Paying Unexpected Costs